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and lip (abbreviated as TxRIla and TxRIip respectively, and TxRIls representing both, hereinafter) , from human pla- 
centa cDNA libraries, and filed it as the patent application Japanese Patent Application No. Hei 10-310422. In the 
process for screening factors binding to one of inhibitors of apoptosis protein in humans, XIAP (X-linked inhibitor of 
apoptosis protein), genes encoding these novel proteins were isolated. Present inventors have analyzed the putative 
amino acid sequence from cDN A for the protein, and discovered the presence of the same amino acid sequence Cys- 
SeCys-Gly-termination codon (TAA) as that in the known TxR near the C-terminus (SEQ ID NOs: 3 and 5). Furthermore, 
they have clarified the SECIS which brings about the insertion of this selenocysteine by comparing it with SECISs 
reported for other selenoproteins described above and 3'UTR of TxRIla to accomplish this invention. Thus, the present 
invention provides the following SECIS, selenoprotein expression vectors using the SECIS, and uses thereof: 

[1] a DNA comprising the nucleotide sequence as defined in SEQ ID NO: 1 ; 

[2] a DNA comprising the nucleotide sequence as defined in SEQ ID NO: 1 , wherein one or more nucleotides are 
substituted, deleted, added, and/or inserted, that inserts selenocysteine for the TGA codon contained in the trans- 
lational region located upstream of said DNA; 

[3] a DNA that hybridizes under stringent conditions to a DNA having the nucleotide sequence as defined in SEQ 
ID NO: 1 , and that inserts selenocysteine for the TGA codon contained in the translational region located upstream 
of said DNA; 

[4] a DNA that hybridizes to the nucleotide sequence as defined in SEQ ID NO: 1 and that has a chain length of 
at least 15 nucleotides; 

[5] a selenoprotein expression vector comprising the following elements: 

==,-— a) a cloning site,4o.which a DNA encoding a selenoprotein amino acid sequence can be inserted, 

b) the DNA of any one of [1], [2], and [3] ligated downstream of said cloning site, and 

c) a regulatory region required for protein expression; 

[6] the selenoprotein expression vector of [5], further comprising a DNA encoding a selenoprotein amino acid 
sequence inserted at the cloning site ; 

[7] a transformant transformed with the selenoprotein expression vector of [6]; 

[8] a method for preparing a selenoprotein comprising the steps of culturing a transformant of [7] and collecting 
the selenoprotein; 

[9] a method for preparing a selenoprotein of [8], wherein the selenoprotein is an enzyme containing a seleno- 
cysteine in its active center; 

[10] the method for preparing a selenoprotein of [9], wherein said enzyme containing selenocysteine in its active 
center is thioredoxin reductase; 

[1 1 ] a DNA that is antisense to the DNA of [1 ] or a portion thereof; and 

[1 2] a method of screening for substances that regulate the expression of selenoproteins, comprising the steps of: 

a) bringing candidate compounds into contact with the transformant of [7], 

b) measuring the selenoprotein contained in the host cells or the culture supernatant thereof and comparing 
it with controls, and 

c) selecting a compound that alters the expression level of selenoproteins by comparison with controls. 

[0009] SECIS of the present invention can be chemically synthesized according to the nucleotide sequence as de- 
fined in SEQ ID NO: 1, or can be obtained from cDNAs of human TxRIla or TxRIip. These cDN As are contained in the 
Human Placenta MATCHMAKER cDNA library, a commercial placenta cDNA library from CLONTECH. Therefore, it 
is easy to obtain a required region by performing PCR, using primers enabling specific amplification of the required 
region. Nucleotide sequences of human TxRIla and TxRIip cDNA are set forth in SEQ ID NOs: 2 and 4. 
[0010] Certain variations in the nucleotide sequence of SECIS are also included in the present invention, inasmuch 
as these modified SECISs are capable of inserting selenocysteine for the TGA codon to be translated into seleno- 
cysteine, which is contained in the translational region of selenoprotein located upstream of the SECIS. Specifically, 
the stem-loop structure (Fig. 4) formed by the mRNA which is transcribed, based on the DNA set forth in SEQ ID NO: 
1 , bears a critical role in the selenocysteine insertion. Therefore, variations in the nucleotide sequence that do not alter 
this structure a great deal are included in this invention. For example, DNAs comprising the nucleotide sequence 
according to SEQ ID NO: 1 , wherein one or more nucleotides are substituted, deleted, added, and/or inserted, that are 
capable of inserting selenocysteine for the TGA codon contained in th translational region iocated upstream of the 
DNA are included in this invention. In addition, DNAs that hybridize under stringent conditions to a DNA comprising 
the nucleotide sequence set forth in SEQ ID NO: 1 , and that are capable of inserting selenocysteine for the TGA codon 
contained in the translational region located upstream of the DNA are also included in this invention. Accordingly, 
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SECIS homologues derived from species other than humans are included in the SECIS of this invention. 
[0011] Many of nucleotide sequences capable of hybridizing under stringent conditions to a specific sequence are 
thought to have similar activities to the functions borne by the specific sequence. An exemplary set of hybridization 
condition is as follows: , in 5x SSC at 25°C in the absence of formamide, preferably in 6x SSC, 40% formamide at 
5 25°C, more preferably in 5x SSC, 50% formamide at 40°C. Washing after hybridization is carried out, for example, in 
2x SSC at 37°C, preferably in Ix SSC at 55°C, and more preferably in Ix SSC at 60°C. 

[0012] Whether a DNA with a certain nucleotide sequence is capable of inserting selenocysteine for the TGA codon 
contained in the translational region located upstream of the DNA can be ascertained by examining the amino acid 
sequence of expression product of an expression vector inserted with the DNA downstream of the DNA encoding 
10 selenoprotein. Namely, the translation of the DNA into a protein with the expected molecular weight is considered as 
a proof that the insertion of selenocysteine is achieved. Alternatively, when selenocysteine constitutes the active center, 
the insertion thereof can be confirmed by examining the activity of the protein. TXRIls, and such, are used, for example, 
as selenoproteins. 

[0013] By ligating SECIS thus obtained downstream of a cDNA encoding a selenoprotein beforehand, the SECIS 
15 inserts selenocysteine for the UGA codon in the translation of the sequence. SECIS of this invention may be positioned 
at any position relative to the translational region. That is, SECIS can be ligated or inserted to any arbitrary sites of the 
3'UTR of the former. However, since one of advantages of this invention is its ability to shorten the whole cDNA including 
the 3'UTR, it is preferable not to leave unnecessary 3'UTR. Furthermore, the efficiency of selenocysteine uptake can 
be controlled by relocating the stem-Joop structure. For example, in the chimeric gene for type I tetraiodothyronine 
20 deiodinase, a 4-fold elevation of selenocysteine insertion activity has been observed, as compared with that by the ^ x 
original cDNA, by ligating the other SECIS of selenoprotein than the original one to the 3'UTR (Maria J. et al. Functional { } 
characterization of the eukaryotic SECIS elements which direct _selenocyste in e insertion at UGA codonsr EM BO J., 
- 12, 331 5-3322, 1 993). Based on these prior arts, one can locate the SECIS of this invention so as to achieve efficient 
insertion of selenocysteine. 

25 [0014] SECIS of the present invention can be incorporated beforehand, together with a regulatory region required 
for the protein expression , into an expression vector to form a selenoprotein expression vector. A vector expressing a 
selenoprotein in its complete form, containing selenocysteine, can be easily constructed by locating the cloning site 
upstream of SECIS in the vector and inserting cDNA encoding the selenoprotein amino acid sequence at this site. 
Prior reports have demonstrated that SECISs can function even if the gene located upstream thereof is replaced with 

30 a gene for other selenoprotein (Maria J. et al. Recognition of UGA as a selenocysteine codon in type I deiodinase 
requires sequences in the 3' untranslated region, Nature 353, 273-276, 1 991 ; Maria J. et al. Functional characterization 
of the eukaryotic SECIS elements which direct selenocysteine insertion at UGA codons. EMBO J., 12, 3315-3322, 
1993). As shown in the Examples, the SECIS of the present invention has the ability to insert selenocysteine not only 
into the TxRIla andTxRII0 from which it was derived but also into other selenoproteins. That is, SECIS of this invention 

35 manifest the activity necessary to insert selenocysteine in the expression of selenoproteins in host animal cells, re- 
gardless of the kind of the proteins. Herein, the "regulatory region" required for protein expression refers to promoters 
and enhancers. 

[001 5] In the bovine TxR, it has been demonstrated that selenocysteine bears an important role in its reducing activity. 
Since human TxR and TxRIls also have a structure supporting the reducing activity of bovine TxR, that is, an amino _ 
40 acid sequence comprising Cys-SeCys-Gly-termi nation codon, selenocysteine is thought to exert great effects on their f. _) 
enzyme activities as well. SECIS is a unique nucleotide sequence required for the translation of this selenocysteine, 
and an important element in the expression of these proteins. 

[0016] Furthermore, the present invention also provides DNAs capable of hybridizing to the DNA set forth in SEQ 
ID NO: 1 and having a chain length of 15 nucleotides or more. These DNAs according to this invention are useful as 

45 probes for SECIS detection or primers for SECIS amplification. For example, the detection of mRNA which hybridizes 
to SECIS-specific probes in certain tissues or cultured cells indicates a possibility for the strong expression of a gene 
encoding a protein having selenocysteine as its constituting amino acid in the cell. This further suggests the possibility 
of isolation and identification of unknown selenoproteins via DNAs hybridizing to SECIS of this invention. In the context 
of the present invention, analysis of mRNA can be performed by, for example, conventional technique such as Northern 

so blotting assay, for example. In addition, nucleotide sequences can be amplified using DNAs of this invention as primers 
with mRNA of SECIS or genomic DNA as the template. 

[0017] The design of nucleotide sequences to be used as probes and primers based on a given nucleotide sequence 
is routinely carried out by those skilled in the art. In the context of the present invention, the DNAs to be used for these 
purposes should have a chain Ingth f at least 15 nucleotides, so as to be able to hybridize to the target sequence 
55 under stringent conditions . Preferably, they are oligonucleotides comprising 15 to 200 nucleotides, more preferably 
25 to 100 nucleotides. Nucleotide sequences that achieve the specific hybridization need not be completely comple- 
mentary to the targeted nucleotide sequence. Variations in the sequence are acceptableinasmuch as these variants 
exhibit the required specificity under stringent conditions. Oligonucleotides having the predetermined nucleotide se- 
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quences can be obtained by chemical synthesis, for example. These oligonucleotides can be tagged with labeling 
components, including, but not limited to, enzymes, fluorescent substances, luminous substances and radioisotopes 
as the occasion demands. For attachment (immobilization) and such of the DNAs to a solid phase to separate them 
from these labeling substances or labeling probes that did not hybridize, direct chemical binding methods are known. 

s Alternatively, they can be indirectly bound to each other by.labeling oligonucleotides with ligands such as haptens or 
biotin, for example, and using antibodies recognizing these ligands and receptors for biotin, and so on. 
[0018] Furthermore, the present invention also provides DNAs that are anti-sense to the DNA set forth in SEQ ID 
NO: 1 , or portions thereof. The SECIS according to this invention bears an important role in selenoprotein expression, 
and its activity is supported by a stem-loop structure constructed in the mRNA (Fig. 4). Therefore, antisense sequences 

10 interfering with this structure will exert inhibitory action on the translation system of selenoprotein located upstream 
thereof. Since SECIS of this invention is derived from mRNAs of TxRIls, antisense sequences according to this inven- 
tion will function as expression inhibitors specific to TxRIls. 

[0019] In addition, the present invention also provides a method of screening for substances that regulate the ex- 
pression of selenoproteins. Namely, host cells transformed with the selenoprotein expression vector according to the 
15 present invention enable the screening of substances regulating expression of the proteins. On bringing these cells 
into contact with candidate expression regulating compounds, the TxR activity in the cells or culture supernatant thereof 
is increased or decreased depending on the expression regulating activity of the candidate compounds. Therefore, it 
is possible to screen regulating substances that inhibit or stimulate the expression by comparing changes in this activity 
with that of untreated controls. Mammalian cell lines such as 293T are preferable as host cells. 

20 

Brief Description of the Drawings 

[0020] ~Rg. l is a graphical representation of TxR activity, measured by the DTNB assay with recombinant TxRI or 
TxRIIot. Ordinate represents the absorbance at 412 nm, and abscissa the reaction time, (a) through (g) indicate ex- 
25 pression products of the following vectors : 



(a) pCMV6myc, 

(b) pCMV6myc-TxRlla, 

30 (c) pCMV6myc-TxRliaA3'UTR, 

(d) pCMV6myc-TxRlla3'UTR A.S., 

(e) pCMVemyc-TxRIIaCysAS'UTR sense, 

(f) pCMV6myc-TxRI-flag, and 

(g) pCMV6myc-TxRI-flag-3'UTR. 

35 

[0021] Fig. 2 is a graphical representation of TxR activity, measured by the insulin assay using recombinant TxRI or 
TxRIIa. Ordinate represents relative activity (%) to that of TxRIla taken as 1 00, and abscissa the types of vectors, (a) 
through (g) indicate expression products of the following vectors: 

40 (a) pCMV6myc, 

(b) pCMV6myc-TxRII<x, 

(c) pCMVemyc-TxRIIctAS'UTR, 

(d) pCMV6myc-TxRllo3'UTR A.S., 

(e) pCMVemyc-TxRIIaCysAS'UTR sense, 
« (f) pCMV6myc-TxRI-flag, and 

(g) pCMVSmyc-TxRI-flag-S'UTR. 

[0022] Fig. 3 is a series of schematic diagrams showing the structure each of vectors (b) through (g) expressed in 
Examples, whose 3'UTR is artificially modified. 
so [0023] Fig. 4 is a schematic diagram showing the stem-loop structure formed by the SECIS according to this invention 
in the mRNA transcribed from the nucleotide sequence set forth in SEQ ID NO: 1 . 

Best Mode for Carrying out the Invention 

55 [0024] The present invention will be described below in more detail with reference to examples. 

[0025] To demonstrate that the active center of TxRIls is the selenocysteine residue (SeCys) at the carboxyl terminal 
side, and that the 3'UTR thereof is essential for recognizing the UGA codon as the codon for SeCys, the following four 
types of TxRI la expression vectors have been constructed. In addition, to prove that the 3*UTR of TxRIIa is also useful 
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the vector using a fluorescence sequencer (PERKINELMER) based on Sanger method. As a result, plasmid 
D N ApC M V6myc-Tx R 1 1 otA3*UTR , which is a pCMV6myc vector inserted with the TxRIla gene sequence deprived of 
the 3'UTR, was obtained, 

3. Construction of pCMV6myc-TxRHaCysA3'UTR 

[0033] The gene sequence comprising TxRIla deprived of its 3'UTR, in which the codon for SeCys, its putative active 
center, was substituted with that for cysteine (Cys), was subcloned into mammalian cell expression vector pCMV6myc. 
PCR was carried out with the pACT2-TxRlla plasmid DNA as the template, using the following primers, to obtain the 
desired PCR product For the subsequent experiments, the 3'-primer was designed to yield cDNA encoding an amino 
acid sequence having the C-terminus amino acid sequence T-V-T-G-C-SeCys-G-stop deprived of its final stop codon 
and comprising T-V-T-G-C-Cys-G. 

5'-primer (pACT2 primer 4 (27mer), SEQ ID NO: 8); 

5'-TAC CCA TAC GAT GTT CCA GAT TAC GCT-3\ which is located upstream of MCS in the pACT2 vector. 
3'-primer (TxRllaCys-R1 (30mer), SEQ ID NO: 9); 

5'-ATA CTC GAG CCC ACA GCA GCC TGT CAC CGT-3', wherein the ^'-terminal three nucleotides (ATA) are 
for conducting treatment with restriction enzyme smoothly, and the 5* -terminal nucleotides at 4th to 9th positions 
(CTC GAG) are the restriction enzyme Xhol site. 2) PCR conditions: 

[0034] PCR was performed using a GeneAmp PCR System 2400 (PERKINELMER) according to the following pro- 
gram: _ _ _. _ ... = . . ' . „ . . • 

a) 94°C 5 min, 

b) 94°C 1 min, 57°C 1 min and 72°C 2 min, 20 cycles, and 

c) 72°C10min. 

3) Sequence confirmation by cloning and sequencing: 

[0035] PCR products were treated with the restriction enzymes BamHI and Xhol, fractionated by agarose electro- 
phoresis, and purified according to a standard method. The pCMV6myc vector was digested with the restriction en- 
zymes BamHI and Xhol, fractionated by agarose electrophoresis, and purified by a usual method. After ligating these 
digests, E. coti strain DH5a was transformed with the resulting recombinant vector. The plasmid DNA was purified by 
a standard method from the obtained colonies, and the PCR product in the vector was identified using a fluorescence 
sequencer (PERKINELMER), based on the Sanger method. As a result, the plasmid DNA, pCMV6myc- 
TxRllaCysA3'UTR, which is a pCMV6myc vector incorporated with the TxRIla gene sequence deprived of the 3'UTR 
and having the codon for SeCys replaced at the putative active center for the codon for cysteine (Cys), was obtained. 

Cloning of the 3'UTR in the human TxRIla 

[0036] The following primers were synthesized from the DNA sequence set forth in SEQ ID NO: 2 (Japanese Patent 
Application No. Hei 10-310422), and the 3'UTR sequence of TxRIla was obtained by PCR. The primers used were 
those that can amplify the 358 nucleotides at the 1573rd to the 1 930th positions in the sequence set forth in SEQ ID 
NO: 2. 

1) Preparation of primers: 
[0037] 

5'-primer (TxRllot3'UTR-F1 (30mer), SEQ ID NO: 10) 

5'-ATA TCT AGA TAA GCG CCA TCC CTG CAG GCC-3\ wherein the *5'-terminal three nucleotides (ATA) are for 
conducting treatment with restriction enzymes smoothly, and the 5'-terminal nucleotides at 4th to 9th positions 
(TCTAGA) are the restriction enzyme Xbal site. 
3' -primer (TxRlla3'UTR-R1 (30mer), SEQ ID NO: 11); 

5'-GCG TCT AGA CAC ACT TCA GAA AAA GTA CCC-3', wherein the *5'-terminal three nucleotides (GCG) are 
for conducting treatment with restriction enzymes smoothly, and the 5'-terminal nucleotides at 4th to 9th positions 
(TCTAGA) are the restriction enzyme Xbal site. 
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2) PCR: 

[0038] Using the plasmid pACT2-TxRlla, inserted with a full-length cDNA containing the 3'UTR of TxRIla, as the 
template DNA, the 3'UTR was amplified by PCR. PCR was carried out using a GeneAmp PCR System 2400 (PERK- 
5 INELMER) according to the following program: 

a) 94°C 5 min, 

b) 94°C 1 min, 58°C 3 min and 72°C 1 min, 1 cycle, 

c) 94°C 1 min, 66°C 0.5 min and 72°C 0.5 min, 20 cycles, and 
io d)72°C10min. 

3) Cloning of PCR product into pSL-1180 vector: 

[0039] After PCR, the amplified DNA fragment was identified by electrophoresis in 1% agarose, and treated with the 
15 restriction enzyme Xbal. After the DNA fragment treated with the restriction enzyme was electrophoresed in 1% aga- 
rose, it was excised from the gel, and purified by the glass matrix method (BIQ1 01 , GeneClean). pSL-11 80 (Amersham 
Pharmacia Biotech) was treated with the restriction enzyme Xbal, and then purified by the alkaline phosphatase treat- 
ment. The Xbal-treated vector and DNA fragment were ligated by a usual method. The E. coli strain DH5a was trans- 
formed with the resulting plasmid according to a standard method, and the plasmid DNA was purified from colonies 
20 thus obtained using the polyethylene glycol precipitation method. The PGR product in the vector was identified by a 
fluorescence sequencer (PERKINELMER), based on Sanger method. Thus, a plasmid DNA, pSL-3'UTR, namely, the 
pSL-1180 vector incorporated with the 3'UTR sequence of .human TxRIla gene, was obtained. _ ^= = 

4. Construction of pCMV6myc-TxRlla3'UTR A.S. 

25 

[0040] pSL-3'UTR was treated with the restriction enzyme Xbal to excise the 3'UTR of TxRIla. 
[0041] pCMV6myc-TxRllaA3'UTR was digested with the restriction enzyme Xbal followed by the treatment with al- 
kaline phosphatase. These two digests were ligated according to a standard method, and E coli strain DH5a was 
transformed with the resulting plasmid. The plasmid DNA was purified from colonies thus obtained using the polyeth- 
30 ylene glycol precipitation method, and the PCR product within the vector was identified by a fluorescence sequencer 
(PERKINELMER) , based on Sanger method. As a result, the pCMV6myc-TxRlla3'UTR A.S., wherein the 3'UTR of 
TxRIla is incorporated in the reverse (antisense) direction downstream of TxRlloA3'UTR, was obtained. 

5. Construction of pCMV6myc-TxRUaCys3'UTRsense and A.S. 

35 

[0042] pSL-3'UTR was digested with the restriction enzyme Xbal to excise the 3'UTR of TxRIla. 
[0043] pCMV6myc-TxRllaCysA3'UTR was digested with the restriction enzyme Xbal followed by the treatment with 
alkaline phosphatase. These two digests were ligated according to a usual method, and the E. coli strain DH5a was 
transformed with the resulting plasmid. The plasmid DNA was purified from colonies thus obtained using the polyeth- 
40 ylene glycol precipitation method, and the PCR product within the vector was identified by a fluorescence sequencer 
(PERKINELMER), based on Sanger method. As a result, a pCMVSmyc-TxRIIaCysS'UTRsense, wherein the 3'UTR of 
TxRIla is incorporated in the normal (sense) direction downstream of TxRllaCysA3'UTR, was obtained. 

Cloning of human thioredoxin reductase I 

45 

1 . Construction of pCMV6myc-TxRI-flag 
1) Preparation of primers: 

so [0044] PCR primers described below were synthesized from the previously reported DNA sequence (GenBank ac- 
cession number D88687) of human thioredoxin reductase I (hereinafter abbreviated as TxRI). For the subsequent 
experiments, a 3'-primer was designed to yield cDNA encoding an amino acid sequence comprising the C-terminal 
amino acid sequence, L-Q-A-G-C-SeCys-G-stop, from which the final stop codon is removed. 

55 5'-primer(TxRI-F1 (30mer), SEQ !D NO: 12) 

5'-ATA GGA TCC ATG TCA TGT GAG GAC GGT CGG-3', wherein the *5'-terminal three nucleotides (ATA) are 
for conducting treatment with restriction enzymes smoothly, and the 5'-terminal nucleotides at 4th to 9th positions 
(GGATCC) are the restriction enzyme BamHI site. 
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3*-primer (TxRI-R1 (30mer), SEQ ID NO: 13); 

5*-ATA CTC GAG ACC TCA GCA GCC AGC CTG GAG-3', wherein the *5'-terminal three nucleotides (ATA) are 
for conducting treatment with restriction enzymes smoothly, and the S'-terminal nucleotides at 4th to 9th positions 
(CTCGAG) are the restriction enzyme Xhol site. 

5 

2) PCR: 

[0045] A full length human TxRI gene was amplified by PCR using a plasmid DNA purified from the Human Placenta 
MATCHMAKER cDNA library purchased from CLONTECH as the template DNA. PCR was performed using a Gene- 
10 Amp PCR System 2400 (PERKINELMER) according to the following program: 

a) 94°C 5 min, 

b) 94°C 1 min, 56°C 3 min and 72°C 2 min, 1 cycle, 

c) 94°C 1 min, 65°C 1 min and 72°C 2 min, 35 cycles, and 
15 d)72°C10min. 

3) Cloning of PCR product into pCMV6myc-flag vector: 
3)-1 Purification of PCR product 

20 

[0046] After PCR, the amplified DNA fragment was identified by electrophoresis in 1% agarose, and then digested 
with the restriction enzymes BamHI and Xhol. The restriction. enzyme-treated DNA fragment was electrophoresed in 
1% agarose, excised from the gel, and purified by the glass matrix method (BI0101, GeneClean). 

25 3)-2 pCMV6myc-flag vector 

[0047] The pCMV6myc-flag vector has the pcDNA3 (Invitrogen) as the basic skeleton, which is incorporated with a 
DNA encoding six repeated amino acid sequences, referred to as the Myc tagged epitope, at the Hindi Jl and BamHI 
sites of its multicloning site (MCS), and also with another DNA encoding an amino acid sequence, designated as the 
30 flag epitope, at the Xhol and Xbal sites downstream of the Myc tag gene. Therefore, a gene incorporated between the 
Myc tag and flag tag genes would express a gene product ligated to the Myc tag at its N-terminus and the flag tag at 
its C-terminus. 

[0048] Since products of genes incorporated into MCS can be detected with antibodies to these Myc and flag tags 
even when no antibody to the product is available, these tags are widely used in a variety of expression systems. 

35 

3)-3 Ligation and transformation of E. coli 

[0049] The PCR product which had been cleaved with the restriction enzymes BamHI and Xhol and then purified 
were ligated to the pCMV6myc-flag vector, which had been cleaved with the same restriction enzymes and purified 

*o according to a usual method. The E. constrain DH5ct was transformed with the above plasmid according to the standard 
method (Hanahan, D. (1983) Studies on transformation of Escherichia coli with plasmids, J. Mol. Biol. 166: 557), and 
the plasmid DNA was recovered from colonies thus obtained using the alkaline SDS method. Plasmid DNA thus col- 
lected was cleaved with appropriate restriction enzymes, and the incorporation of desired PCR product into the vector 
was confirmed by agarose electrophoresis. Recovered DNA was purified by the polyethylene glycol precipitation meth- 

45 od, and PCR product in the vector was identified using a fluorescence sequencer (PERKINELMER) based on Sanger 
method. Thus, a plasmid DNA, pCMV6myc-TxRI-flag, that is, the pCMV6myc-flag vector incorporated with a full length 
human TxRI gene, was obtained. 

2. Construction of pCMV6myc-TxRI-flag-3'UTR 

50 

[0050] pSL-3'UTR was treated with the restriction enzyme Xbal to excise the 3'UTR of TxRIIot. 
[0051] pCMV6myc-TxRI-flag was treated with the restriction enzyme Xbal, followed by the treatment with alkaline 
phosphatase. These digests were ligated by a standard method. E. coli strain DH5a was transformed with the above 
plasmid according to a standard method. The plasmid DNA was recovered from colonies thus obtained and purified 
55 by the polyethylene glycol precipitation method. The PCR product within the vector was identified using a fluorescence 
sequencer (PERKINELMER) based on Sanger method. Thus, the pCMV6myc-TxRI-flag-3'UTR, in which the 3'UTR 
of TxRIla was incorporated downstream of TxRI gene in the normal (sense) direction was obtained. 
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Purification and activity assay of recombinant TxRI and TxRIIcc proteins 
1) Purification of Myc-tagged fusion proteins between TxRI and TxRI la: 
5 [0052] 

(a) pCMV6myc, 

(b) pCMV6myc-TxRlla, 

(c) pCMV6myc-TxRllaA3'UTR, 

10 (d) pCMVemyc-TxRIIaS'UTR A.S., 

(e) pCMV6myc-TxRllaCysA3'UTR sense, 
(0 pCMV6myc-TxRI-flag, and 
(g) pCMV6myc-TxRI-flag-3'UTR. 

is [0053] Cultured cell 293T, derived from the human fetal kidney cells, were transformed with the mammalian cell 
expression plasmids (a) to (g) by the standard lipofection method. Cells were recovered 48 h after the transformation 
to prepare cell extracts, to which the anti-Myc monoclonal antibody bound to protein A sepharose was added, and the 
resulting mixture was gently stirred at 4°C for 2 h. The Myc-tagged fusion protein bound to the anti-Myc monoclonal 
antibody immobilized to protein A sepharose was precipitated by centrifugation, and, after the removal of supernatant, 

20 washed several times with NETN buffer (1 G mM Tris-HCi, 1 mM EDTA, 0,5% N P-40 and 1 50 mfvTNaCi). The supernatant ^ 
was completely removed, and diluted in 1 00 mM potassium phosphate, pH 7.0, containing 1 0 mM EDTA and 0.2 mg/ \£y 
ml bovine serum albumin. . . 



25 



30 



2) Activity assay 

[0054] Activity assay of thioredoxin reductase was performed by the following two methods according to the standard 
technique (Holmgren, A. et al. Methods Enzymol. 252: 199, 1995). 

Method 1 . 5,5'-Dithiobis(2-nitrobenzoic acid) (DTNB) assay 

[0055] In the DTNB assay, the TxR activity to produce TNB from DTNB is measured by the thiol absorbance at 412 
nm based on the following formula: 



35 DTNB + NADPH + H + -> 2TNB + NADP + 

[0056] Assay buffer 1 : 

[0057] 100 mM potassium phosphate, pH 7.0, containing 10 mM EDTA, 0.25 mM NADPH, 0.2 mg/ml bovine serum 
albumin (BSA), 1% ethanol, and 1 mM DTNB. 
40 [0058] Purified Myc-tagged fusion protein (1 to 50 pj) was added to the assay buffer 1 to make a total volume 1 .0 f 
ml. The absorbance at 412 nm of the assay mixture was followed at 25°C for 20 min. Results are shown in Fig. 1 . 

Method 2. Insulin assay 

45 [0059] Assay buffer 2: 

[0060] 50 mM phosphate buffer pH 7.0 containing 20 mM EDTA, 80 mM insulin, 0.25 mM NADPH, 16 mM £ coli 
thioredoxin-S2. 

[0061] Purified Myc-tagged fusion protein (1 to 50 uJ) was added to the assay buffer 2 to make a total volume 1 .0 
ml. The oxidation of NADPH was measured at 30°C for 5 min as the absorbance at 340 nm. Trx is reduced by the TxR 
so activity, and the reduced Trx further promotes the reduction of insulin. In this case, TxR activity can be measured from 
the amount of NADPH to be oxidized. The amount of NADPH which was oxidized was calculated from the following 
equation. Relative values of expression products due to each of vectors were expressed as percentage of the TxRI la 
activity taken as 100%, and were graphically represented (Fig. 2). 

55 

AA340 X 0.5 / 6.2 
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Results 

[0062] No activity was observed when the 3'UTR of TxRIIct was omitted (c), and also when the 3'UTR of TxRIla was 
inserted in the antisense direction (d). No activity was also observed when SeCys was substituted with Cys (e) regard- 
less of the absence or presence of its 3'UTR. Likewise, in the case where the 3'UTR was missing from TxRI (f) , no 
activity was detected. However, when the 3'UTR of TxRIIct was added to TxRI, the activity of TxRI was observed (g) 
[0063] As these experiments demonstrate, the fact that the enzyme activity was lost, when the SeCys residue at the 
C-terminal side of TxRIla was substituted with Cys, strongly suggests that the active center of TxRIla is the SeCys 
residue. By the loss of enzyme activity due to deletion of its 3'UTR, it could be confirmed that the 3'UTR is essential 
for recognizing the UGA codon as that for SeCys. Furthermore, the fact that TxR activity was gained by adding the 
3'UTR of TxRIla to other selenoprotein TxRI demonstrates that the 3'UTR of TxRIla is useful for incorporating the 
SeCys residue into other selenoproteins. 

Industrial Applicability 

[0064] The SECIS of this invention allows for the translation of selenoproteins without fail while having a sequence 
of less than 360 b. This size (360b) is extremely small as compared with known SECISs, comprising, for example, 
several thousands nucleotides. Nucleotide sequences of the size of the present invention will not exert unfavorable 
effects under usual conditions when inserted into vectors. Furthermore, such a SECIS itself is easily synthesized. 
[0065] SECIS of this invention comprise a sequence having a broad activity universaiiy acting on not only TxRIls 
from which it was derived but also overall selenoproteins. 
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SEQUENCE LISTING 
<110> Medical & Biological Laboratories Co. , Ltd. 

<120> Selenocystein Insertion Sequence 

<130> M3-010PCT 



<140> 
<141> 

<150> JP 1998-310422 
<15I> 1998-10-30 

<150> JP 1998-325344 
<151> 1998-11-16 

<160> 13 

<170> Patentln Ver. 2.0 
<210> 1 
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<211> 358 
<212> DNA 

<213> Homo sapiens 
<400> 1 

taagcgccat ccctgcaggc cagggcacac ggtgcgcccg ccgccagctc ctcggaggcc 60 
agacccagga tggctgcagg ccaggtttgg ggggcctcaa ccctctcctg gagcgcctgt 120 
gagatggtca gcgtggagcg caagtgctgg acgggtggcc cgtgtgcccc acagggatgg 180 
ctcaggggac tgtccacctc acccctgcac ctttcagcct ttgccgccgg gcaccccccc 240 
caggctcctg gtgccggatg atgacgacct gggtggaaac ctaccctgtg ggcacccatg 300 
tccgagcccc ctggcatttc tgcaatgcaa ataaagaggg tactttttct gaagtgtg 358 

<210> 2 

<2U> 1959 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (10).. (1S72) 
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<220> 

<221> 3'UTR 

<222> (1573).. (1930) 

<220> 

<221> misc_structure 
<222> (1567).. (1569) 

<223> tga is translated to selenosystein, shown by Xaa. 

<400> 2 . — - - 

atggccgca atg gcg gtg gcg ctg egg gga tta gga ggg cgc ttc egg tgg 51 
Met Ala Val Ala Leu Arg Gly Leu Gly Gly Arg Phe Arg Trp 
1 5 10 

egg acg cag gec gtg gcg ggc ggg gtg egg ggc gcg gcg egg ggc gca 99 
Arg Thr Gin Ala Val Ala Gly Gly Val Arg Gly Ala Ala Arg Gly Ala 
15 20 25 30 



gca gca ggt cag egg gac tat gat etc ctg gtg gtc ggc ggg gga tct 147 
Ala Ala Gly Gin Arg Asp Tyr Asp Leu Leu Val Val Gly Gly Gly Ser 
35 40 45 



ggt ggc ctg get tgt gec aag gag gec gee cag ctg gga agg aag gtg 195 
Gly Gly Leu Ala Cys Ala Lys Glu Ala Ala Gin Leu Gly Arg Lys Val 
50 55 60 
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40 



45 



50 



55 



gcc gtg gtg gac tac gtg gaa cct tct ccc caa ggc acc egg tgg ggc 243 
Ala Val Val Asp Tyr Val Glu Pro Ser Pro Gin Gly Thr Arg Trp Gly 
65 70 75 

etc ggc ggc acc tgc gtc aac gtg ggc tgc ate ccc aag aag ctg atg 291 
Leu Gly Gly Thr Cys Val Asn Val Gly Cys He Pro Lys Lys Leu Met 
80 85 90 



cac cag gcg gca ctg ctg gga ggc ctg ate caa gat gcc ccc aac tat 339 
His Gin Ala Ala Leu Leu Gly Gly Leu He Gin Asp Ala Pro Asn Tyr 
25 95 100 105 110 



ggc tgg gag gtg gcc cag ccc gtg ccg cat gac tgg agg aag atg gca 387 
Gly Trp Glu Val Ala Gin Pro Val Pro His Asp Trp Arg Lys Met Ala 
115 120 125 

gaa get gtt caa aat cac gtg aaa tec ttg aac tgg ggc cac cgt gtc 435 
Glu Ala Val Gin Asn His Val Lys Ser Leu Asn Trp Gly His Arg Val 
130 135 140 

cag ctt cag gac aga aaa gtc aag tac ttt aac ate aaa gcc age ttt 483 
« 

Gin Leu Gin Asp Arg Lys Val Lys Tyr Phe Asn lie Lys Ala Ser Phe 
145 150 155 

gtt gac gag cac acg gtt tgc ggc gtt gcc aaa ggt ggg aaa gag att 531 
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Val Asp Glu His Thr Val Cys Gly Val Ala Lys Gly Gly Lys Glu He 
160 165 170 

ctg ctg tea gec gat cac ate ate att get act gga ggg egg ccg aga 579 
Leu Leu Ser Ala Asp His lie He He Ala Thr Gly Gly Arg Pro Arg 
175 180 185 190 

tac ccc acg cac ate gaa ggt gec ttg gaa tat gga ate aca agt gat 627 
Tyr Pro Thr His He Glu Gly Ala Leu Glu Tyr Gly He Thr Ser Asp 
195 200 205 



gac ate ttc tgg ctg aag gaa tec cct gga aaa acg ttg gtg gtc ggg 675 
Asp He Phe Trp Leu Lys Glu Ser Pro Gly Lys Thr Leu Val Val Gly 
30 210 215 220 



gec age tat gtg gee ctg gag tgt get ggc ttc etc acc ggg att ggg 723 
Ala Ser Tyr Val Ala Leu Glu Cys Ala Gly Phe Leu Thr Gly He Gly 
225 230 235 



ctg gac acc acc ate atg atg cgc age ate ccc etc cgc ggc ttc gac 771 
45 Leu Asp Thr Thr He Met Met Arg Ser lie Pro Leu Arg Gly Phe Asp 

240 245 250 



55 



cag caa atg tec tec atg gtc ata gag cac atg gca tct cat ggc acc 819 
Gin Gin Met Ser Ser Met Val He Glu His Met Ala Ser His Gly Thr 
255 260 265 270 
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15 



20 



25 



30 



35 



£"V'') 40 



45 



50 



55 



egg ttc ctg agg ggc tgt gec ccc teg egg gtc agg agg etc cc*t gat 867 
Arg Phe Leu Arg Gly Cys Ala Pro Ser Arg Val Arg Arg Leu Pro Asp 
275 280 285 

ggc cag ctg cag gtc acc tgg gag gac age acc acc ggc aag gag gac 915 
Gly Gin Leu Gin Val Thr Trp Glu Asp Ser Thr Thr Gly Lys Glu Asp 
290 295 300 

acg ggc acc ttt gac acc gtc ctg tgg gec ata ggt cga gtc cca gac 963 
Thr Gly Thr Phe Asp Thr Val Leu Trp Ala He Gly Arg Val Pro Asp 
305 310 315 

acc aga agt ctg aat ttg gag aag get ggg gta gat act age ccc gac 1011 
Thr Arg Ser Leu Asn Leu Glu Lys Ala Gly Val Asp Thr Ser Pro Asp 
320 325 330 

act cag aag ate ctg gtg gac tec egg gaa gec acc tct gtg ccc cac 1059 
Thr Gin Lys He Leu Val Asp Ser Arg Glu Ala Thr Ser Val Pro His 
335 340 345 350 

ate tac gec att ggt gac gtg gtg gag ggg egg cct gag ctg aca ccc 1107 
lie Tyr Ala He Gly Asp Val Val Glu Gly Arg Pro Glu Leu Thr Pro 
355 360 365 

aca gcg ate atg gec ggg agg etc ctg gtg cag egg etc ttc ggc ggg 1155 
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Thr Ala lie Met Ala Gly Arg Leu Leu Val Gin Arg Leu Phe Gly Gly 
370 375 380 

tec tea gat ctg atg gac tac gac aat gtt ccc acg acc gtc ttc acc 1203 
Ser Ser Asp Leu Met Asp Tyr Asp Asn Val Pro Thr Thr Val Phe Thr 
385 390 395 

cca ctg gag tat ggc tgt gtg ggg ctg tec gag gag gag gca gtg get 1251 
Pro Leu Glu Tyr Gly Cys Val Gly Leu Ser Glu Glu Glu Ala Val Ala 
400 405 410 



cgc cac ggg cag gag cat gtt gag gtc tat cac gee cat tat aaa cca 1299 
Arg His Gly Gin Glu His Val Glu Val Tyr His Ala His Tyr Lys Pro 
30 415 420 425 430 



ctg gag ttc acg gtg get gga cga gat gca tec cag tgt tat gta aag 1347 
Leu Glu Phe Thr Val Ala Gly Arg Asp Ala Ser Gin Cys Tyr Val Lys 
435 440 445 



&tg gtg tgc ctg agg gag ccc cca cag ctg gtg ctg ggc ctg cat ttc 1395 
45 Met Val Cys Leu Arg Glu Pro Pro Gin Leu Val Leu Gly Leu His Phe 

450 455 460 



55 



ctt ggc ccc aac gca ggc gaa gtt act caa gga ttt get ctg ggg ate 1443 
Leu Gly Pro Asn Ala Gly Glu Val Thr Gin Gly Phe Ala Leu Gly He 
465 470 475 
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aag tgt ggg get tec tat gcg cag gtg atg egg ace gtg ggt ate cat 1491 
Lys Cys Gly Ala Ser Tyr Ala Gin Val Met Arg Thr Val Gly He His 
480 485 490 

ccc aca tgc tct gag gag gta gtc aag ctg cgc ate tec aag cgc tea 1539 
Pro Thr Cys Ser Glu Glu Val Val Lys Leu Arg He Ser Lys Arg Ser 
495 500 505 510 

ggc ctg gac ccc acg gtg aca ggc tgc tga ggg taagegecat ccctgcaggc 1592_ 
Gly Leu Asp Pro Thr Val Thr Gly Cys Xaa Gly 
515 520 

cagggcacac ggtgcgcccg ccgccagctc ctcggaggcc agacccagga tggctgeagg 1652 

ccaggtttgg ggggcctcaa ccctctcctg gagegectgt gagatggtca gcgtggagcg 1712 

caagtgctgg acgggtggec cgtgtgcccc acagggatgg ctcaggggac tgtccacctc 1772 

acccctgcac ctttcagcct ttgccgccgg gcaccccccc caggctcctg gtgccggatg 1832 

atgacgacct gggtggaaac ctaccctgtg ggcacccatg tccgagcccc ctggcatttc 1892 

tgcaatgcaa ataaagaggg tactttttct gaagtgtgta aaaaaaaaaa aaaaaaaaaa 1952 

aaaaaaa 1959 
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<210> 3 
<211> 519 
<212> PRT 

<213> Homo sapiens 

<223> Xaa(520) means selenosysteine. 

<400> 3 

Met Ala Val Ala Leu Arg Gly Leu Gly Gly Arg Phe Arg Trp Arg Thr 
1 5 10 15 

Gin Ala Val Ala Gly Gly Val Arg Gly Ala Ala Arg Gly Ala Ala Ala 
20 25 30 

Gly Gin Arg Asp Tyr Asp Leu Leu Val Val Gly Gly Gly Ser Gly Gly 
35 40 45 

Leu Ala Cys Ala Lys Glu Ala Ala Gin Leu Gly Arg Lys Val Ala Val 
50 55 60 

Val Asp Tyr Val Glu Pro Ser Pro Gin Gly Thr Arg Trp Gly Leu Gly 
65 70 75 80 

Gly Thr Cys Val Asn Val Gly Cys lie Pro Lys Lys Leu Met His Gin 
85 90 95 
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Ala Ala Leu Leu Gly Gly Leu lie Gin Asp Ala Pro Asn Tyr Gly Trp 
100 105 110 

Glu Val Ala Gin Pro Val Pro His Asp Trp Arg Lys Met Ala Glu Ala 
115 120 L25 

Val Gin Asn His Val Lys Ser Leu Asn Trp Gly His Arg Val Gin Leu 
130 135 140 

Gin Asp Ar g Lys Val Lys Tyr Phe Asn lie Lys Ala Ser Phe Val Asp 
145 150 155 160 

Glu His Thr Val Cys Gly Val Ala Lys Gly Gly Lys Glu lie Leu Leu 
165 170 175 

Ser Ala Asp His lie He He Ala Thr Gly Gly Arg Pro Arg Tyr Pro 
180 185 190 

Thr His He Glu Gly Ala Leu Glu Tyr Gly He Thr Ser Asp Asp He 
195 200 205 

Phe Trp Leu Lys Glu Ser Pro Gly Lys Thr Leu Val Val Gly Ala Ser 
210 215 220 

Tyr Val Ala Leu Glu Cys Ala Gly Phe Leu Thr Gly He Gly Leu Asp 
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225 230 235 240 

Thr Thr He Met Met Arg Ser He Pro Leu Arg Gly Phe Asp Gin Gin 
245 250 255 

Met Ser Ser Met Val He Glu His Met Ala Ser His Gly Thr Arg Phe 
260 265 270 

Leu Arg Gly Cys Ala Pro Ser Arg Val Arg Arg Leu Pro Asp Gly Gin 
275 280 285 .. 

Leu Gin Val Thr Trp Glu Asp Ser Thr Thr Gly Lys Glu Asp Thr Gly 
290 295 300 

Thr Phe Asp Thr Val Leu Trp Ala He Gly Arg Val Pro Asp Thr Arg 
305 310 315 320 

Ser Leu Asn Leu Glu Lys Ala Gly Val Asp Thr Ser Pro Asp Thr Gin 
325 330 335 

Lys He Leu Val Asp Ser Arg Glu Ala Thr Ser Val Pro His He Tyr 
340 345 350 

Ala He Gly Asp Val Val Glu Gly Arg Pro Glu Leu Thr Pro Thr Ala 
355 360 365 
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He Met Ala Gly Arg Leu Leu Val Gin Arg Leu Phe Gly Gly Ser Ser 
370 375 380 

Asp Leu Met Asp Tyr Asp Asn Val Pro Thr Thr Val Phe -Thr Pro Leu 
385 390 395 400 

Glu Tyr Gly Cys Val Gly Leu Ser Glu Glu Glu Ala Val Ala Arg His 
405 410 415 

Gly Gin Glu His Val Glu Val Tyr His Ala His Tyr Lys Pro Leu Glu 
420 425 430 

Phe Thr Val Ala Gly Arg Asp Ala Ser Gin Cys Tyr Val Lys Met Val 
435 440 445 

Cys Leu Arg Glu Pro Pro Gin Leu Val Leu Gly Leu His Phe Leu Gly 
450 455 460 

Pro Asn Ala Gly Glu Val Thr Gin Gly Phe Ala Leu Gly He Lys Cys 
465 470 475 480 

Gly Ala Ser Jyr Ala Gin Val Met Arg Thr Val Gly He His Pro Thr 
'485 490 495 

Cys Ser Glu Glu Val Val Lys Leu Arg He Ser Lys Arg Ser Gly Leu 
500 505 510 
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Asp Pro Thr Val Thr Gly Cys Xaa Gly 
515 520 

<210> 4 
<211> 2056 
<212> DNA 

<213> Homo sapiens 

- <220> — - 

<221> CDS 

<222> (188).. (1669) 
<220> 

<221> 3' UTR 

<222> (1670).. (2027) 

<220> 

<221> misc_structure 
<222> (1664).. (1666) 

<223> tga is translated to selenosystein, shown by Xaa. 

<400> 4 

gtcccggacc tcaggcccag ttcagtgtac ttcccctctc tacttcctcc ctccagt 
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ttctccatcc ctcccttttt tggctgcccc ttgcctgcct tcctcgccag. tagcttgcag 120 

agtagacacg atgacacctt ttgcaggcta aaaaggctga gagtggcact atgtgcagtg L80 

agccacc atg gag gac caa gca ggt cag egg gac tat gat etc ctg gtg 229 
Met Glu Asp Gin Ala Gly Gin Arg Asp Tyr Asp Leu Leu Val 
1 5 10 

gtc ggc ggg gga tct ggt ggc ctg get tgt gec aag gag gec gec cag 277 
Val Gly Gly Gly Ser Gly Gly Leu Ala Cys Ala Lys Glu Ala Ala Gin 
15 20 25 30 

ctg gga agg aag gtg gec gtg gtg gac tac gtg gaa cct tct ccc caa 32S 
Leu Gly Arg Lys Val Ala Val Val Asp Tyr Val Glu Pro Ser Pro Gin 
35 40 45 

ggc acc egg tgg ggc etc ggc ggc acc tgc gtc aac gtg ggc tgc ate 373 
Gly Thr Arg Trp Gly Leu Gly Gly Thr Cys Val Asn Val Gly Cys He 
50 55 60 

ccc aag aag ctg atg cac cag gcg gca ctg ctg gga ggc ctg ate caa 421 
Pro Lys Lys Leu Met His Gin Ala Ala Leu Leu Gly Gly Leu He Gin 
65 70 75 

gat gec ccc aac tat ggc tgg gag gtg gec cag ccc gtg ccg cat gac 469 
Asp Ala Pro Asn Tyr Gly Trp Glu Val Ala Gin Pro Val Pro His Asp 
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80 85 90 

tgg agg aag £tg gca gaa get gtt caa aat cac gtg aaa tec ttg aac 517 
Trp Arg Lys Met Ala Glu Ala Val Gin Asn His Val Lys Ser Leu Asn 
95 100 105 110 

tgg ggc cac cgt gtc cag ctt cag gac aga aaa gtc aag tac ttt aac 565 
Trp Gly His Arg Val Gin Leu Gin Asp Arg Lys Val Lys Tyr Phe Asn 
115 120 125 



ate aaa gec age ttt gtt gac gag cac acg gtt tgc ggc gtt gec aaa 613 
lie Lys Ala Ser Phe Val Asp Glu His Thr Val Cys Gly Val Ala Lys 
130 135 140 

ggt ggg aaa gag att ctg ctg tea gec gat cac ate ate att get act 661 
Gly Gly Lys Glu lie Leu Leu Ser Ala Asp His He lie He Ala Thr 
145 150 155 



40 

gga ggg egg ccg aga tac ccc acg cac ate gaa ggt gec ttg gaa tat 709 
Gly Gly Arg Pro Arg Tyr Pro Thr His lie Glu Gly Ala Leu Glu Tyr 
« 160 165 170 



gga ate aca agt gat gac ate ttc tgg ctg aag gaa tec cct gga aaa 757 
Gly lie Thr Ser Asp Asp lie Phe Trp Leu Lys Glu Ser Pro Gly Lys 
175 180 185 190 



v 
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acg ttg gtg gtc ggg gcc age tat gtg gec ctg gag tgt get ggc ttc 805 
Thr Leu Val Val Gly Ala Ser Tyr Val Ala Leu Glu Cys Ala Giy Phe 
195 200 205 

etc acc ggg att ggg ctg gac acc acc ate atg atg cgc age ate ccc 853 
Leu Thr Gly He Gly Leu Asp Thr Thr lie Met Met Arg Ser He Pro 
210 215 220 
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50 
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etc cgc ggc ttc gac cag caa atg tec tec atg gtc ata gag cac atg 901 
Leu Arg Gly Phe Asp Gin Gin Met Ser Ser Met Val He Glu His Met 
225 230 235 



gca tct cat ggc acc egg ttc ctg agg ggc tgt gcc ccc teg egg gtc 949 
30 Ala Ser His Gly Thr Arg Phe Leu Arg Gly Cys Ala Pro Ser Arg Val 

240 245 250 



agg agg etc cct gat ggc cag ctg cag gtc acc tgg gag gac age acc 997 
Arg Arg Leu Pro Asp Gly Glh Leu Gin Val Thr Trp Glu Asp Ser Thr 
255 260 265 270 



45 acc ggc aag gag gac acg ggc acc ttt gac acc gtc ctg tgg gcc ata 1045 

Thr Gly Lys Glu Asp Thr Gly Thr Phe Asp Thr Val Leu Trp Ala He 
275 280 285 



ggt cga gtc cca gac acc aga agt ctg aat ttg gag aag get ggg gta 1093 
Gly Arg Val Pro Asp Thr Arg Ser Leu Asn Leu Glu Lys Ala Gly Val 



27 



EP1 126 026 A1 



10 



is 



20 



25 



30 



35 



50 



290 295 300 

gat act age ccc gac act cag aag ate ctg gtg gac tec egg gaa gec 1141 
Asp Thr Ser Pro Asp Thr Gin Lys He Leu Val Asp Set Arg Glu Ala 
305 310 315 

acc tct gtg ccc cac ate tac gee att ggt gac gtg gtg gag ggg egg 1189 
Thr Ser Val Pro His He Tyr Ala He Gly Asp Val Val Glu Gly Arg 
320 325 330 

cct gag ctg aca ccc aca gcg ate atg gec ggg agg etc ctg gtg cag 1237 
Pro Glu Leu Thr Pro Thr Ala He Met Ala Gly Arg Leu Leu Val Gin 
335 340 345 350 

egg etc ttc ggc ggg tec tea gat ctg atg gac tac gac aat gtt ccc 1285 
Arg Leu Phe Gly Gly Ser Ser Asp Leu Met Asp Tyr Asp Asn Val Pro 
355 360 365 



40 ^ 

acg acc gtc ttc acc cca ctg gag tat ggc tgt gtg ggg ctg tec gag 1333 * 
Thr Thr Val Phe Thr Pro Leu Glu Tyr Gly Cys Val Gly Leu Ser Glu 
45 370 375 380 



gag gag gca gtg get cgc cac ggg cag gag cat gtt gag gtc tat cac 1381 
Glu Glu Ala Val Ala Arg His Gly Gin Glu His Val Glu Val Tyr His 
385 390 395 



55 
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gcc cat tat aaa cca ctg gag ttc acg gtg get gga cga gat gca tec 1429 
Ala His Tyr Lys Pro Leu Glu Phe Thr Val Ala Gly Arg Asp Ala Ser 
400 405 410 

cag tgt tat gta aag atg gtg tgc ctg agg gag ccc cca cag ctg gtg 1477 
Gin Cys Tyr Val Lys Met Val Cys Leu Arg Glu Pro Pro Gin Leu Val 
415 420 425 430 

ctg ggc ctg cat ttc ctt ggc ccc aac gca ggc gaa gtt act caa gga 1525 
Leu Gly Leu His Phe Leu Gly Pro Asn Ala Gly Glu Val Thr Gin Gly 
435 440 445 

ttt get ctg ggg ate aag tgt ggg get tec tat gcg cag gtg atg egg 1573 
Phe Ala Leu Gly lie Lys Cys Gly Ala Ser Tyr Ala Gin Val Met Arg 
450 455 460 

ace gtg ggt ate cat ccc aca tgc tct gag gag gta gtc aag ctg cgc 1621 
Thr Val Gly lie His Pro Thr Cys Ser Glu Glu Val Val Lys Leu Arg 
465 470 475 

ate tec aag cgc tea ggc ctg gac ccc acg gtg aca ggc tgc tga ggg 1669 
He Ser Lys Arg Ser Gly Leu Asp Pro Thr Val Thr Gly Cys Xaa Gly 
480 485 490 

taagegecat ccctgcaggc cagggcacac ggtgcgcccg ccgccagctc ctcggaggcc 1729 
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agacccagga tggctgcagg ccaggtttgg ggggcctcaa ccctctcctg- gagcgcctgt 1789 
gagatggtca gcgtggagcg caagtgctgg acgggtggcc cgtgtgcccc acagggatgg 1849 
ctcaggggac tgtccacctc acccctgcac ctttcagcct ttgccgccgg gcaccccccc 1909 
caggctcctg gtgccggatg atgacgacct gggtggaaac ctaccctgtg ggcacccatg 1969 
tccgagcccc ctggcatttc tgcaatgcaa ataaagaggg tactttttct gaagtgtgta 2029 
aaaaaaaaaa aaaaaaaaaa aaaaaaa 2056 

<210> 5 

<211> 492 

<212> PRT 

<213> Homo sapiens 

<223> Xaa(493) means selenosysteine. 

<400> 5 

Met Glu Asp Gin Ala Gly Gin Arg Asp Tyr Asp Leu Leu Val Val Gly 
15 10 15 

Gly Gly Ser Gly Gly Leu Ala Cys Ala Lys Glu Ala Ala Gin Leu Gly 
20 25 30 



30 



EP1 126 026 A1 

Arg Lys Val Ala Val Val Asp Tyr Val Glu Pro Ser Pro Gin Gly Thr 
35 40 45 

Arg Trp Gly Leu Gly Gly Thr Cys Val Asn Val Gly Cys lie Pro Lys 
50 55 60 

Lys Leu Met His Gin Ala Ala Leu Leu Gly Gly Leu He Gin Asp Ala 
65 70 75 80 

Pro Asn Tyr Gly Trp Glu Val Ala Gin Pro Val Pro His Asp Trp Arg 
85 90 95 

Lys Met Ala Glu Ala Val Gin Asn His Val Lys Ser Leu Asn Trp Gly 
100 105 HO 

His Arg Val Gin Leu Gin Asp Arg Lys Yal Lys Tyr Phe Asn He Lys 
115 120 125 

Ala Ser Phe Val Asp Glu His Thr Val Cys Gly Val Ala Lys Gly Gly 
130 135 140 

Lys Glu He Leu Leu Ser Ala Asp His lie He He Ala Thr Gly Gly 
145 150 155 160 

Arg Pro Arg Tyr Pro Thr His He Glu Gly Ala Leu Glu Tyr Gly He 
165 170 175 
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Thr Ser Asp Asp II Phe Trp Leu Lys Glu S r Pro Gly Lys Thr Leu 
180 185 190 

Val Val Gly Ala Ser Tyr Val Ala Leu Glu Cys Ala Gly Phe Leu Thr 
195 200 205 

Gly lie Gly Leu Asp Thr Thr He Met Met Arg Ser He Pro Leu Arg 
210 215 220 

Gly Phe Asp Gin Gin Met Ser Ser Met Val He Glu His Met Ala Ser 
225 230 235 240 

His Gly Thr Arg Phe Leu Arg Gly Cys Ala Pro Ser Arg Val Arg Arg 
245 250 255 

Leu Pro Asp Gly Gin Leu Gin Val Thr Trp Glu Asp Ser Thr Thr Gly 
260 265 270 

Lys Glu Asp Thr Gly Thr Phe Asp Thr Val Leu Trp Ala lie Gly Arg 
275 280 285 

Val Pro Asp Thr Arg Ser Leu Asn Leu Glu Lys Ala Gly Val Asp Thr 
290 295 300 

Ser Pro Asp Thr Gin Lys He Leu Val Asp Ser Arg Glu Ala Thr Ser 
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305 310 315 320 

Val Pro His He Tyr Ala lie Gly Asp Val Val Glu Gly Arg Pro Glu 
325 330 335 

Leu Thr Pro Thr Ala lie Met Ala Gly Arg Leu Leu Val Gin Arg Leu 
340 345 350 

Phe Gly Gly Ser Ser Asp Leu Met Asp Tyr Asp Asn Val Pro Thr Thr 
355 360 365 

Val Phe Thr Pro Leu Glu Tyr Gly Cys Val Gly Leu Ser Glu Glu Glu 
370 375 380 

Ala Val Ala Arg His Gly Gin Glu His Val Glu Val Tyr His Ala His 
385 390 395 400 

Tyr Lys Pro Leu Glu Phe Thr Val Ala Gly Arg Asp Ala Ser Gin Cys 
405 410 415 

Tyr Val Lys Met Val Cys Leu Arg Glu Pro Pro Gin Leu Val Leu Gly 
420 425 430 

Leu His Phe Leu Gly Pro Asn Ala Gly Glu Val Thr Gin Gly Phe Ala 
435 440 445 
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Leu Gly He Lys Cys Gly Ala Ser Tyr Ala Gin Val Met Arg Thr Val 
450 455 460 

Gly lie His Pro Thr Cys Ser Glu Glu Val Val Lys Leii Arg He Ser 
465 470 475 480 

Lys Arg Ser Gly Leu Asp Pro Thr Val Thr Gly Cys Xaa Gly 
485 490 



<210> 6 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Artificial 
Synthesized Primer Sequence 

<400> 6 

tacccatacg atgttccaga ttacgct 

<210> 7 
<211> 30 
<212> DNA 
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<213> Artificial Sequence 

» 

<220> 

<223> Description of Artificial Sequence:Artificial 
Synthesized Primer Sequence 

<400> 7 

atactcgagc cctcagcagc ctgtcaccgt 

<210> 8 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Artificial 
Synthesized Primer Sequence 

<400> 8 

tacccatacg atgttccaga ttacgct 

<210> 9 
<211> 30 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Artificial 
Synthesized Primer Sequence 

<400> 9 

atactcgagc ccacagcagc ctgtcaccgt 

<210> 10 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Artificial 
Synthesized Primer Sequence 

<400> 10 

atatctagat aagcgccatc cctgcaggcc 

<210> 11 
<211> 30 
<212> DNA 
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<220> 

<223> Description of Artificial Sequence .'Artificial 
Synthesized Primer Sequence 

<400> 11 

gcgtctagac acacttcaga aaaagtaccc 



<210> 12 
<21i> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Artificial 
Synthesized Primer Sequence 

<400> 12 

ataggatcca tgtcatgtga ggacggtcgg 

<210> 13 
<211> 30 
<212> DNA 
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10 



15 



<213> Artificial S quence 
<220> 

<223> Description of Artificial Sequence: Artificial 
Synxhesized Primer Sequence 

<400> 13 

atactcgaga cctcagcagc cagcctggag 30 
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- . -Claims -- - - 

25 1. A DNA comprising the nucleotide sequence as defined in SEQ ID NO: 1 . 

2. A DNA comprising the nucleotide sequence as defined in SEQ ID NO: 

1 , wherein one or more nucleotides are substituted, deleted, added, and/or inserted, that inserts selenocysteine 
for the TGA codon contained in the translational region located upstream of said DNA. 

30 

3. A DNA that hybridizes under stringent conditions to a DNA having the nucleotide sequence as defined in SEQ ID 
NO: 1 , and that inserts selenocysteine for the TGA codon contained in the translational region located upstream 
of said DNA. 

35 4. A DNA that hybridizes to the nucleotide sequence as defined in SEQ ID NO: 1 and that has a chain length of at 
least 15 nucleotides. 

5. A selenoprotein expression vector comprising the following elements: 

40 a) a cloning site.to which a DNA encoding a selenoprotein amino acid sequence can be inserted, 

b) the DNA of any one of claims 1 , 2 and 3 ligated downstream of said cloning site, and 

c) a regulatory region required for protein expression. 

6. The selenoprotein expression vector of claim 5, further comprising a DNA encoding a selenoprotein amino acid 
45 sequence inserted at the cloning site. 

7. A transformant transformed with the selenoprotein expression vector of claim 6. 

8. A method for preparing a selenoprotein comprising the steps of culturing a transformant of claim 7 and collecting 
so the selenoprotein. 

9. A method for preparing a selenoprotein of claim 8, wherein the selenoprotein is an enzyme containing a seleno- 
cysteine in its active center. 

55 10. The method for preparing a selenoprotein of claim 9, wherein said enzyme containing selenocysteine in its active 
center is thioredoxin reductase. 

11. A DNA that is antisense to the DNA of claim 1 or a portion thereof. 
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12. A method of screening for substances that regulate the expression of selenoproteins, comprising the steps of: 

a) bringing candidate compounds into contact with the transformant of claim 7, 

b) measuring the selenoprotein contained in the host cells or the culture supernatant thereof and comparing 
5 it with controls, and 

c) selecting a compound that alters the expression level of selenoproteins by comparison with controls. 
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Figure 1 
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Figure 2 
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Figure 3 



(b) pCMV6myc-TxR II 



ScCys 
UGA 



(c) pCMV6poyc-TxR II A 3UTR [ 



^d) pCMV6myc-TxR U-3VTRAJS. 



ScCys 
UGA 



SeCys 
UGA 



U 



(e) pCMV6myc-TxR HCys3'UTRsensc £ 



Cys 
UGU 



(£) pCMV6myc-TxR I-flag 



esSSBBBMaBaBBaBBI 



SeCys 

UGA 

i 



SeCys 
UGA 



(g) pCWV6mycTxR I-flag-3'UTR 



42 



EP1 126 026 A1 



Figure 4 
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